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To comply with growing demand for high effluent quality of Domestic 
Wastewater Treatment Plant (WWTP), a simple and reliable prediction 
model is thus needed. The wastewater treatment technology considered in 
this paper is an Aerobic Granular Sludge (AGS). The AGS systems are 
fundamentally complex due to uncertainty and non-linearity of the system 
makes it hard to predict. This paper presents model predictions and 
optimization as a tool in predicting the performance of the AGS. The 
input-output data used in model prediction are (COD, TN, TP, AN, and 
MESS). After feature analysis, the prediction of the models using Support 
Vector Machine (SVM) and Eeed-Eorward Neural Network (PENN) are 
developed and compared. The simulation of the model uses the experimental 
data obtained from Sequencing Batch Reactor under hot temperature of 50°C. 
The simulation results indicated that the SVM is preferable to PENN and it 
can provide a useful tool in predicting the effluent quality of WWTP. 
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1. INTRODUCTION 

Wastewater is the combination of the water or liquid carried trashes removed from residential area, 
commercial and industrial wastes together with groundwater and surface water as may be present. This 
wastewater is then treated at domestic wastewater treatment system which also known as wastewater 
treatment plant (WWTP). About (99%) ninty-nine percent water by weight and is basically reffered to as 
influent as it enters the WWTP facilty. The wastewater has three stages namely as mechanical treatment 
(primary treatment), biological treatment (secondary treatment) and chemical treatment (tertiary treatment). 
Wastewater flow been treated through all the stages of treatment before it is allowing to convey back to the 
consumer. This treated water is generally reffered as effluent as it flows out the WWTP facility. Wastewater 
treatment system involved complex process and exhibit nonlinear behaviour that resulted in time consuming 
and effort to develop detailed theoretical dynamic models. The main challenge of modelling and control of an 
AGS is it involved complexity of physical, biological and chemical treatment as it is involved with internal 
interactions between process variable and sludge characteristics. Most of the current available models 
(mathematical models) are complex for the system. 

In the past, some deterministic methods such as activated sluge models (ASMl, ASM2, and ASM3) 
were developed to describe the mass balance of biomass (microbiology) activities in treatment process [1]. 
There is an assumptions made in these mathematical models mainly based on engineering principles such 
that models may be highly effective for the prediction of nitrogenous substrate removal and carbonaceous in 
waswater treatment plant system. However, due to these models are high dimensional and contain an 
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outsized number of stoichiometric and kinetic parameters that should be determined by specialist process 
operation, they may not be practical for on-line plant control. Besides, it may not provide good prediction of 
plant if other current process such as sedimentation are not included in the overall plant evaluation. 

A popular Artificial Intelligent (AI) such as artificial neural network (ANN) methods has been 
widely used for prediction for WWTP due to its good capability in modelling and prediction of process 
performance [2]. However, compared with multiple linear regression or other conventional methods [3-5], 
ANN methods shows high accuracy but large number of data is needed as some shortcomings will appeare 
from its theoretical statical basis for instance accuracy of the prediction is not satisfying when the training 
data set is small. As in against Support Vector Machine (SVM), has been proposed to solve this type of 
problems as SVM is based on statistical learning theory [6]. SVM has several merits compared to ANN such 
as efficient utilization of high-dimensional feature space, distinctively in solvable optimization problem and 
has a good ability in theoretical analysis using computional learning theory [4]. 

As a highly approach for the model with limited training samples set, the SVM has been applied in 
many field such prediction [7], patern recognition problem [8], regression [9, 10], classification [11] and time 
series analysis [11, 12]. In addition, SVM has been reported outperformed traditional statistical learning 
methods such as ANN. Thus, more researcher are focusing on this method as SVM has been arousing more. 
Eor example, in order to make a beginner learner to understand SVM regression and classification were 
reviewed by Smola et al. [13] and Burges et al. [14], respectively. In addition, a library for SVM has been 
develop by [15] in order to help beginners to easily apply SVM according to their field of application. 
Eurthermore, Noble described the explanation of SVM and its biological applications [16]. Specifically, Xu 
et al. adopted SVM for chemometrics in term of classification [17]. While Jia. et al. showed that SVM was 
able to predict the synthesis characteristic of hydraulic valve in industrial production [8]. Other than that, 
Liang et al. had done an effective approach on SVM for content based on sketch retrival [18]. Evaluation of 
using various method such as wavelets, PC A and SVM has been proposed by Gumus et al [19]. All these 
studies show all the application and had make a good contribution of SVM. 

The example of implementation of SVM with limited dataset in WWTP can be found in [20], which 
developed SVM model to study the relationship between membrane bioreactor and affecting factors that 
cause fouling. The study reported that SVM model can assure a good prediction result compared to RBENN 
model since the depending of the training dataset for SVM is lesser than RBENN. Other than that, W. Li Juan 
et al. [21] done a research on predicting effluent quality of WWTP plant with cyclic activated sludge process 
(ASP) using SVM model. The SVM model was reported to gives a high accuracy in predicting quality 
effluent with small learning ability and have a good generalization. X. Xi et al. [22], presented SVM model 
that was applied to predict the permeate flux and rejection of Bovine serum albumin (BSA). The result shows 
SVM model more accurate than ANN model. This is due to ANN model need to rely on the ones expertise 
and experience while SVM is based on statistical theory. K. Gao et al. [23], developed SVM model for 
membrane permeate flux during dead-end microfiltration of activated sludge reactor. The SVM model were 
then compared with BPNN model shows SVM model obtained higher accuracy than BPNN. The researcher 
concluded that in small sample size data, SVM yield better accuracy than BPNN model. This is because 
SVM has a rigorous foundation either in mathematical or theoretical and it based on statistical theory despite 
BPNN that needs to rely on the designer knowledge. Hence, that’s make SVM model only need small 
number of data to achieved higher prediction performance. Therefore, this paper compares the SVM and 
PENN methods in term of their accuracy performance of the model using a limited experimental dataset of 
SBR aerobic granular sludge nutrient removal process. The aim of this paper is to minimize the correlation 
and mean squeare error of the training dataset. The 60 days’ data (with 21 samples) under the temperature of 
50oC is utilized to develop the model for AGS. 

This paper consists of five sections and are organized as follows. The first section will introduce on 
background study and relavent literarature on previous study on the proposed method. The second section 
will be presented SVM and PENN structure with the flow of the modelling method. The detailed of the 
experimental setup of this study is explained in the third section. In the fouth section is the result and 
discussion. The result is compared and discussed in this section. The fifth section gives conclusion and some 
future works. 


2. EXPERIMENT SETUP 

Experiment setup in this study were carried out in three double-walled cylindrical column bioreactor 
with internal diameter of 6.5 cm and height of 100 cm. The sequencing bioreactor sequence (SBR) such as 
aerator pump, influent feeding and effluent discharge were controlled by programmable logic controller 
(PEC). The working temperature for the bioreactor was set to 30, 40 and 50 ± 1°C and controlled using 
thermostat and water bath sleeves. The schematic diagram of the AGS system is shown in Pigure 1. 
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Figure 1. Schematic diagram of laboratory scale aerobic granular sludge bioreactor 


In this work, the SBR operated at 50 ± 1°C is considered. The influent parameters that used as input 
of the model are chemical oxygen demand (COD), total nitrogen (TN), total phosphorus (TP), amnonia 
nitrogen (AN) and mixed liquor suspended solid (MLSS) while the same influent parameters are considered 
as an effluent or output of the model (COD, TN, TP, AN and MLSS). Figure 2 shows (a) Total Phosphorus 
(TP), Total Nitrogen (TN), and Amonia Nitrogen (AN) influent dataset, (b) Total Phosphorus (TP), Total 
Nitrogen (TN), and Amonia Nitrogen (AN) effluent dataset, (c) Chemical Oxygen Demand (COD) 
concentration dataset, (d) Mixed Liquor Suspended Solid (MLSS) dataset. 
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(c) (d) 

Eigure 2. (a) Total Phosphorus (TP), Total Nitrogen (TN), and Amonia Nitrogen (AN) influent dataset, (b) 
Total Phosphorus (TP), Total Nitrogen (TN), and Amonia Nitrogen (AN) effluent dataset, (c) Chemical 
Oxygen Demand (COD) concentration dataset, (d) Mixed Liquor Suspended Solid (MESS) dataset 


3. RESULTS METHOD 

Support Vector Machine (SVM) is first introduced by Vapnik [24] which is based on the idea of 
structural risk management. SVM is basically a supervised learning method that is capable in generating 
predictive models and used for classifiying unseen patterns into their categories. Quite recently, SVM method 
has been extended in solving function estimation and regression problem. The algorithm of SVM method is 
briefly explained in the following section. 

3.1. Support vector machine 

The basic theory of Support Vector Machine (SVM) regression function is expressed as in (1) [9]. 

y=f (x)= (o.x+b ( 1 ) 

where y is scalar output, v is a weight vector, v is multivariate input and h is bias. By introducing slack 
variables, and f the SVM model can be expressed by (2) and (3). 

Minimize (pico) = ^\\(o\\^ - C Yd=i(^i - ' C > 0 (2) 

( y — wx — b < e + fi, 

wx - b - y < e + = 1 , 2 ,.., (3) 

> 0 , 

The new predicted SVM for each x' is determined by evaluating (4) 
y = ^ )x, .x' + b 


3.1.1. Validation of SVM model 

The division processes of normalized sequencing batch reactor data are determined by three folds 
cross validation concept based on two folds for training and onefold for testing. The cross validation is to 
access the performance of the SVM model besides checking the suitable parameters of cost of penalty, and 
gamma, value to be used. The best model is selected based on the accuracy performance correlation (R2) and 
mean square error (MSB) value. Eigure 3 shows the flowchart of SVM method. 
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Figure 3. Flow chart of SVM prediction model 


3.2. Feed-forward neural network 

Basic theory of Feed forward neural network (FFNN) is the combination multilayer network. First 
layer is where the input from external input received. The second layer will be connected from the previous 
layer neurons. This layer will continue until it reached the final output layer neuron. Figure 4 shows the basic 
FFNN structure. 



Figure 4. Feed forward neural network structure 
This network is represented by (5); 

L 

y' = - cc~)Xi.x' + b 


( 5 ) 
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Where is the model output of EENN, F. is a function of the network, W- and W- is the connection 

layer of biases and weight, (p is input layer. Eigure 5 shows the flowchart of EENN method. 
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Eigure 5. Elow chart of SVM prediction model 


4. RESULT AND DISCUSSION 

In general, both SVM and ANN techniques can provide good results in training and testing, 
although in terms of accuracy of the models, SVM is preferable to ANN. The model prediction of AGS 
system considered in this work are COD, TP, AN and AN variables. Eigure 6(a) shows the COD evaluation 
results during training dataset. The SVM method predicted a good results as it achived 99.99% for R2 and 
0.001 for MSE while EENN achived slightly lower than SVM which is 92.52% for R2 and 0.0047 for MSE. 
In COD estimation for testing data, SVM gives higher accurate result compared with EENN method for all 
evaluation criterion. Eigure 6(b) shows the comparison prediction result performace of COD for testing 
where SVM result score are 99.84% and 0.0 while EENN score are 96.44% and 0.0021 for R2 and MSE 
value, respectively. The COD estimation for overall training and testing performance result as shown in 
Table 1. 

Results for TP training data shows good accuracy performance for both model techniques. The R2 
result for both model SVM and EENN achieved more than 90%. The R2 value for SVM and EENN are 
96.34% and 93.36% while MSE are 0.0017 and 0.0047 respectively. Eigure 7(a) shows the comparison graph 
of TP for both model SVM and EENN. The testing result for TP estimation shows higher prediction value for 
SVM model compared to EENN model. The scored of both testing value for R2 are 99.73% and 95.46% 
while MSE results are 0.0003 and 0.0044 respectively. Eigure 7(b) shows the testing data result. The 
performance results of TP for both training and testing are shown in Table 2. 
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(a) 


(b) 


Figure 6. (a) COD training result for SVM and ANN, and (b) COD testing result for SVM and ANN 


Table 1. COD model evaluation for training and t esting T able 2. TP testing result for SVM and ANN 


Model _ Training _ Testing _ Model _ Training _ Testing 



R\%) 

MSE 

R\%) 

MSE 


R\%) 

MSE 

R\%) 

MSE 

SVM 

99.99 

0.0001 

99.84 

0.0000 

SVM 

96.34 

0.0017 

99.73 

0.0003 

FFNN 

92.53 

0.0047 

96.44 

0.0021 

EENN 

93.36 

0.0047 

95.46 

0.0044 




(a) (b) 

Figure 7. (a) TP training result for SVM and ANN, (b) TP testing result for SVM and ANN 


The TN results show good prediction result for SVM method compared to FFNN method. The R2 
for training and testing of SVM method both achieved 90% above and MSE 0.0006 while for FFNN method, 
the R2 and MSE is slightly lower prediction with 75.31% and 0.0022 respectively. During testing, R2 and 
MSE achieved 95.69 and 0.002. Figure 8(a) and 8(b) show an overall performance for both methods. Table 3 
presents the performance comparison of both methods. 


Bulletin of Electr Eng and Inf, Vol. 8, No. 4, December 2019 : 1392 - 1401 








































Bulletin of Electr Eng and Inf 


ISSN: 2302-9285 


n 1399 



(a) (b) 

Eigure 8. (a) TN training result for SVM and ANN, (b) TN testing result for SVM and ANN 


The result for AN estimation shows acceptable accuracy from both methods. During AN testing, 
SVM methods achieved 95.09% and 0.0036 for R2 and MSE value, respectively. Eor EENN, the achieved R2 
and MSE are 86.06% and 0.0101, respectively. During testing, SVM achieved the R2 of 90.12% and MSE of 
0.0012 which is more accurate than EENN with R2 of 80.48% and MSE of 0.0023. Eigure 9(a) and 9(b) show 
the AN output response between SVM and EENN during training and testing. Table 4 indicates an overall 
performance for both methods. 




(a) (b) 

Eigure 9. (a) AN training result for SVM and ANN, (b) AN testing result for SVM and ANN 

T able 3. TN testing result for SVM and AN N Ta ble 4. AN testing result for SVM and AN N 


Model 

Training 

Testing 

Model 

Training 

Testing 


R^(%) 

MSE 

R^(%) 

MSE 


R^(%) 

MSE 

R^(%) 

MSE 

SVM 

96.34 

0.0017 

99.45 

0.0003 

SVM 

95.09 

0.0036 

90.12 

0.0012 

FFNN 

93.36 

0.0047 

95.69 

0.0020 

EENN 

86.06 

0.0101 

80.48 

0.0023 


5. CONCLUSION 

This paper presents the modelling techniques for aerobic granular sludge using SVM and compared 
with feed-forward neural network (EENN). Erom the simulation results, it can be observed that the training 
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result using SVM produces better accuracy of model for the prediction at the temperature of 50°C. This 
provide an evident that SVM can give better prediction compared to FFNN model for limited training data. 
To improve the computational time of SVM parameters in future, external optimizations such as gravitational 
search algorithm (GSA), and Particle Swarm Optimization (PSO) can be considered. 
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